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Abstract 

Quantitative understanding of human behaviors provides elementary comprehension 
of the complexity of many human-initiated systems. In this paper, we investigate the 
behavior of people on the BBS forum by the statistical analysis of the amounts of 
view and reply of posts. According to our statistics, we find that the amounts of view 
and reply of posts follow the power law distributions with different power exponent. 
Furthermore, we discover that the amounts of view and reply of posts have nonlinear 
relationship. They are related by power function and show us straight line in log-log 
plot. Based on the estimation of slope and intercept of the line, we can characterize the 
behaviors quantitatively and know that people of Chinese forum and those of foreign 
forum have different preference towards replying to and viewing the posts. At last, we 
analyze the burstiness and memory in replying time series. They show some universal 
properties among different forum. All of them locate themselves in the high-B, low-M 
region. 

Keyword: Human Dynamics, BBS forum, Burstiness and Memory 

PACS: 89.75.Da - Systems obeying scaling laws; 89.20.Hh - World Wide Web, Internet; 
89.65.-S - Social and economic systems 



1 Introduction 

Human behavior, as an academic issue in science, has a history about one century from 
Watson p]. As a joint interest of sociology, psychology and economics, human behavior 
has been extensively investigated during the last decades. A basic assumption embedded 
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in the previous analysis on human dynamics is that its temporal statistics are uniform 
and stationary, which can be properly described by a Poisson process. Accordingly, the 
interevent time distribution should have an exponential tail [2]- However, through studying 
the distributions of interevent time of the human behavior such as the people to send out 
or reply E-mail and surface mail, Albert Barabasi discovered that these human behaviors 
present obvious deviation to the Poisson process: long time silent and short time high 
frequency eruption simultaneously present in these human behaviors, and their distributions 
of interevent time have power law fat tail [3]. 

From E-mail communications [31 [4] to surface mail communications O [HI |7j and short 
message communications [8] , from financial activities to library loans , from web browsing 
[9] to on-line movie watchings [TO], and more and more examples show that the distributions 
of interevent time of the human behavior can not be described by a Poisson process but may 
use power function to fit well. According to the previous studies about human dynamics, we 
find that its methodology, excavating statistical laws from the historical records of human 
activities, can apply to analyze other issues. For instance, we can also use some of those 
datasets to quantify the herd behavior of an individual, that is to say, following the opinions 
of the majority of people in his/her social surrounding in an irrational way. 

With the development of information technology, web surfing has become a part of 
people's daily life, including E-mail communication, dealing with financial matters, reading 
news online, browsing web, downloading software and so on. In the cyber world, BBS 
forum provides people a platform for mutual communication. Those behaviors like "lurking, 
spamming, sofast", etc. are the ways in which Netizens publicly express their ideas. The 
means of expression is so unique for its dual role of "openness" and "concealment". Therefore 
the BBS forum is loved by the majority of Internet users. 

On the BBS forum, the new threads and Internet users are increasing, and the Internet 
users can browse information and express their own points. In the process of browsing 
information and expressing their own points, the most common group is formed on the 
BBS forum, that is to say, browsing the posts of the same subject will form a provisional 
group, different themes has different groups. However, in the provisional group, some people 
simply browse information, and others will publish their own opinions. Then there are the 
amounts of view and reply on the BBS forum. So what is the difference between the different 
groups, and how are these differences reflected are the interesting issues to be researched. In 
our paper, we focus on the actions of Internet users through the BBS forum to solve these 
problems. The dates come from the Sina Forum and ChatAvenue Forum, the first one is 
the largest Chinese community and the other one is the current leader in free online chat 
rooms. 

Our paper is organized as follows. In SeclSl we present that the amounts of view and 
reply of posts which follow the power law distributions and have different power exponents. 
In SecEl we address the nonlinearity relationship of the amounts of view and reply of posts 
and we also calculate the fitted line to characterize the forum. According to the fitted line, 
we find that the differences between the different forums. In SecHl we analyze the burstiness 
and memory in replying time series. 



2 



2 The amounts of view and reply 



Under computer program's help, we can carry on the statistics to the amounts of view and 
reply about posts of different themes. Here we collect the dates from Sina Community 
Forum and Chat Avenue Forum. 

Sina Community Forum is the world's largest Chinese community and the Internet's 
most comprehensive well-known BBS, has a huge core group of users, the theme of the plate 
covers culture, life, society, current affairs, sports, entertainment and other areas [TT]. We 
got the amounts of view and reply about posts of the entertainment, sports, life, economics 
themes from Sina Community Forum. 

Economic Commentary: From April 17, 2001 to November 11, 2008, there were 32,480 
posts. Pet Club: From December 1, 2005 to November 17, 2008, there were 39,950 posts. 
China Football: From June 14, 2001 to July 13, 2008, there were 95,891 posts. Pastime 
Puzzles: From November 19, 2004 to July 8, 2008, there were 49,346 posts. 




China Football Pastime Puzzles 



Figure 1: The cumulative distribution functions P{x) and their maximum Hkelihood power- 
law fits, for the posts of the amounts of view and reply from Sina Community Forum. The 
red lines stand for the reply of cumulative distribution and the blue lines stand for the view 
of cumulative distribution. 

ChatAvenue is the current leader in free online chat rooms. Most of users are from the 
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USA, Canada, Australia and the United Kingdom. And it offers a very active message forum 
with over 120,000 members, an arcade, and photo gallery [12]. We got the amounts of view 
and reply about posts of World Events-Politics and News, Jokes, General, and Graphics 
Forum. 

World Events-Politics and News: From June 10, 2006 to February 23, 2009, there were 
1,558 posts. Jokes: From June 10, 2006 to February 25, 2009, there were 1,583 posts. 
General: From June 10, 2006 to March 3, 2009, there were 1,250 posts Graphics Forum: 
From June 10, 2006 to March 13, 2009, there were 1,131 posts. 

According to the number of posts from Sina Forum and ChatAvenue Forum, we can 
see that the Chinese Internet Forums are more active than foreign ones. 




World news Jokes 




General Graphics 

Figure 2: The cumulative distribution functions P{x) and their maximum likelihood power- 
law fits, for the posts of the amounts of view and reply from ChatAvenue Forum. The red 
lines stand for the reply of cumulative distribution and the blue lines stand for the view of 
cumulative distribution. 

A power law distribution is observed when we inspect the total number of views and 
repHes a post receives (see FIG [l] and FIGl2]), indicating that the vast majority of posts 
generates little interest, while a few posts are highly popular. At the same time, the dis- 
tribution of the amounts of view and reply has the different power exponents. Mostly the 
power exponents of reply are larger than that of view, although the amounts of reply are 
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Table 1: The values of power exponent and Xmin of different themes 



Forum 


Theme 


View 


Reply 


a 




a 




Sina 


Economic Commentary 


2.05 


256 


2.91 


57 


Pet Club 


1.94 


207 


2.66 


13 


China Football 


1.87 


119 


3.06 


60 


Pastime Puzzles 


1.62 


92 


2.51 


46 


ChatAvenue 


World News 


2.57 


382 


2.81 


26 


Jokes 


3.5 


213 


3.31 


7 


General 


2.52 


570 


2.5 


31 


Graphics 


1.98 


175 


3.22 


49 



Table 2: The values of power exponent and Xmin of other themes from Sina Forum 



Forum 


Theme 


View 


Reply 






a 




a 






Arrange Finance 


2.07 


232 


2.12 


9 




Education Viewpoint 


1.61 


1121 


1.92 


3 




Postgraduate Communication 


2.04 


2372 


2.13 


5 




Diet Forum 


1.92 


493 


3.28 


17 


Sina 


OSEuropean Cup 


2.05 


162 


2.55 


35 




Film Television World 


1.81 


1144 


2.74 


94 




People Gossip 


2.12 


896 


2.38 


38 




The ebb and flow of Overseas 


1.93 


394 


2.69 


47 




Words on World 


2.09 


13689 


2.18 


8 



smaller than that of view (see Table [T]). The power exponents a range from 1.6 to 3.5. The 
values of power exponent and Xmin are based on the method of Ref . [13] . 

From Table [TJ we also can discover that the exponents of view from ChatAvenue Forum 
are bigger than those of view from Sina Community Forum. But the exponent of reply is no 
much difference. This shows that the viewers of posts from Sina Community Forum have 
greater range, this may be related to the forum rules — more attended posts will be topped 
and forum also can recommend some wonderful posts to Internet users. 

Besides, we also study other themes from Sina Forum, all of them have the similar broad 
distribution. In the Table [21 we list the values of power exponent and Xmm of other themes 
from Sina Forum. And they have the similar characteristic, namely the values of power 
exponent range from 1.6 to 3.5, and the power exponents of view are smaller than those of 
reply. 

Some questions should be put forward, for example, why the distribution of the amounts 
of view and reply follow the power but have the different power exponents, and what re- 
lationship exists between the views and repHes. We will solve these questions in the next 
section. 
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Economic Commentary Pet Club China Football Pasttime Puzzles 




World News Jokes General Graphics 



Figure 3: The relationship between the amounts of view and those of reply is approximately 
linear in the log-log coordinate. 



3 The relationship of views and rephes 

The relationship between the amounts of view and those of reply can be observed through 
the FIG 121 and it shows non-Hnearity which could be described by Eq.lHl). 

\ogy^A + B\ogx (1) 

where y stands for the amounts of reply, x stands for the amounts of view. From analysis 
of Sect 12 we know that the amounts of view and those of reply can be described by power 
law distribution. Therefore, here we suppose that Probability Density Function of views is 

fix) = cix-'^^ (2) 

The Probability Density Function of replies is 

f{y) = C2y-'' (3) 

From Eq.(Il|) and Eq.(l3]), we can get that 

fiy) B y B ) (4) 
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Contrasting to Eq.(l3]), we can find that 



C2 = -e B (5) 
r2 = 1 + (6) 



From Eq.® and Eq.® we can evaluate that 

i? = ^ (7) 
r2 - 1 

A = - (log B + log C2 - log Cl ) (8) 

r2 - 1 

Here ri and r2 are known quantities, so we must evaluate unknown quantities ci and C2. 
We know the Probability Density Function is 

= I others 
According to the characteristic of the Probability Density Function 



We can evaluate the value of c. 

n 1 

where n stands for the number of Zmin < z < z^ax , Zmin can be used from Table [H Zmax 
is the largest number which can easily get from the series of views or replies. N stands for 
the all number of z. Therefore, we can easily evaluate the value of ci and C2 

ni 1 , , 

ci = - ^x^.. ,,,, (12) 



n2 1 



c2 = ^p=rT^ (13) 

So we can get the value of A 



A = flog ^ + log ^ + log ^izZ^ ) (14) 

r2 - 1 \ 7-2-1 ni 22 V 



According to Eq. |[T|) , we can know that the fitted line is determined by the parameters A 
and B. For the same amounts of view, when the value of B is larger, there would be more 
people to reply the post. The value of gives the probability of the first viewer to reply 
the post. The value of e~^^^ gives the number of people who had viewed the post before 
there is the first reply to the post. 

In terms of the above analysis, from Table O we can know that when there is the same 
amounts of view, the groups in the ChatAvenue Forum reply more than the groups in the 
Sina Forum. Most people in the Sina Forum prefer viewing to replying the posts. The 
differences between ChatAvenue Forum and Sina Forum come from the different culture 
and social environment possibly. 
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Table 3: The values of A ,B and ~A/B 



Forum 


Theme 


A 


B 


-A/B 


Sina 


Economic Commentary 


-2.74 


0.550 


4.98 


Pet Club 


-3.35 


0.566 


5.92 


China Football 


-3.44 


0.422 


8.15 


Pastime Puzzles 


-3.70 


0.411 


8.39 


ChatAvenue 


World News 


-5.17 


0.967 


5.35 


Jokes 


-7.83 


1.08 


7.25 


General 


-3.53 


1.01 


3.50 


Graphics 


-0.456 


0.441 


1.03 



Table 4: The values of B and M 



Forum 


Theme 


B 


M 


Sina 


People Gossip 


0.8243 


0.0378 


The ebb and flow of Overseas 


0.7856 


0.0457 


Words on World 


0.7296 


0.01108 


ChatAvenue 


World News 


0.8897 


0.0057 


Jokes 


0.8337 


0.0057 


Graphics 


0.8258 


0.0348 



4 Burstiness and memory 

The dynamics of a wide range of real systems display a bursty, characterized by short 
timeframes of intense activity followed by long times of no or reduced activity. In Ref.[T3] 
K.-I. Goh and A.-L. Barabasi propose to characterize the bursty nature of real signals 
using orthogonal measures quantifying two distinct mechanisms leading to burstiness: the 
interevent time distribution (see Eq ll5p and the memory (see Eq fTBl) . 

B = — (15) 

where rrir and ar are the mean and the standard deviation of -P(t), respectively. 

M = —} ^ iZl^+i £Z (16) 

I— I 

where nr is the number of interevent times measured from the signal and mi (771,2) and cri((T2) 
are sample mean and sample standard deviation of r^'s (r^+i's), respectively(z=l, nr — 1). 

The time of posting and replying can be as a time signal, then we can easily calculate 
the interevent time distribution and the memeory (see Table H]). 

In Table [4l the themes of People Gossip, The ebb and flow of Overseas and Words 
on World belong to Sina Community Forum, and the themes of World News, Jokes and 
Graphics come from ChatAvenue Forum. From the Table lU we can discover that all the 
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themes of the values B and M have no much differences which will further explain that the 
replying behaviors of Sina Community Forum and Chat Avenue Forum are similarity. All 
of them can illustrate that the replying time series are bursty signal, in other words, the 
front people who reply the post can not affect the behind ones, they would reply a post 
depending on their interests. 

5 Conclusion and discussion 

In this paper we discovered that the amounts of view and reply on BBS forum appear the 
power law distributions, the power exponents range from 1.6 to 3.5. Moreover, mostly the 
power exponents of reply are bigger than that of view. Furthermore, we found that the 
amounts of view and those of reply has the nonlinear relationship. According to the rela- 
tionship, we presented two parameters — the slope and intercept of the log- log straight line 
which can evaluate human's interests on replying or viewing. With the above measurements 
and statistics, we find some differences between Sina Community Forum and Chat Avenue 
Forum, which may be caused by cultural, social environments, and forum rules. At last, we 
analyze the burstiness and memory in replying time series. It could be found that all the time 
series have similar qualitative properties. The Sina Community Forum and ChatAvenue 
Forum have no much differences in replying time series. 
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